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Abstract 

In market modeling, one often treats buyers as a homogeneous group. In this 
paper we consider buyers with heterogeneous preferences and products available in 
many variants. Such a framework allows us to successfully model various market 
phenomena. In particular, we investigate how is the vendor's behavior influenced 
by the amount of available information and by the presence of correlations in the 
system. 
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1 Introduction 

The standard economics textbooks make the supply-demand law as one of the pillars of the 
modern economic theory. However, many people, especially economists (see for example 
[Ij), gradually realize that the most important factor is missing in the traditional supply- 
demand law. The study of complex systems [21 [3l H] has already led to novel approaches 
to market phenomena. In a previous work [3] , one of us introduced a simple framework to 
treat both quality and information capability, yielding a generalized supply demand law. 
However, in the previous paper, product is simply characterized by a single scalar variable: 
quality. In the modern economy we face a much more complex world, where products have 
many attributes and consumers have heterogeneous tastes [6] . These preferences cannot be 
simply represented as price and quality alone. We therefore generalize the previous work to 
allow multiple variants of each product as well as many different tastes among consumers. 

Thus the producers face a dilemma: whether to target the average taste by producing 
a single or a few variants to leverage the economy of scale, or to match precisely each 
consumer's taste [7]. We shall see that the answer depends on the information level that 
the producers may access: whether they know, and how well they know the consumers' 
preferences. In addition, producers face also the nonlinear production costs. All these 
factors have to compromise to yield a combined result that gives various degrees of product 
diversity. With our approach, the supply demand problem of producers with the capability 
of producing variations and consumers' diverse tastes becomes a matching problem [HI 
[9], where many mathematical and statistical mechanic tools are available to handle the 
complexity of the combinatorial problem. 
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In this paper we build a market model and investigate its behavior under various 
circumstances. In the first part of the paper we do not consider correlations between 
preferences of the parties included in the system. While unrealistic, this assumption allows 
us to discover basic properties of the model and outline the way of reasoning which can be 
used also in later, more realistic considerations. In the second part of the paper we discuss 
correlations and the ways how they can be introduced to the system. The last part of the 
paper deals with the consequences of the correlations for the model. 

2 General framework: one vendor with many buyers 

Let's start with a market where only one vendor and M buyers are present. The vendor 
can produce different variants of a product (e.g. many different shoes). With regard 
to the market, he has to decide which variants it is optimal to produce. We assume that 
all buyers satisfied with the offer buy one item, others stay out of the trading. Buyers in 
the market we label with lowercase Latin letters (i = 1, . . . , M). The different variants the 
vendor can produce we label with Greek letters {a = 1, . . . , A^). The price of variant a 
we label as Pa- We assume that every variant can be produced in as many pieces as it is 
needed and as fast as it is needed. 

The simple structure sketched above offers us enough space to model basic features 
of real markets. To establish a mathematical model for the market we have to introduce 
some assumptions about participants' preferences and their consequences on the trading 
process. To keep complexity of the model at minimum we assume that buyer's opinion 
about a variant can by represented by one scalar quantity, which we call cost and label 
it with x; we assume x e [0; 1]. The smaller is the cost Xi^a, the bigger is the probability 
that buyer i is satisfied with variant a when asked. Preferences of the vendor are easier 
to introduce; they are represented by costs which he suffers during production and sale 
of a particular variant. The cost for variant a we label and after a proper rescaling of 
monetary units i/a G [0; 1]. To simplify our considerations, we arrange the variants in order 
of cost: yi < 1/2 < • ■ ■ < UN- 

We stress a conceptual difference between vendor's and buyer's costs. The seller's 
cost ya is strictly monetary — it represents a real amount of money (although in arbitrary 
units). In contrast, the buyer's cost Xi^^ has no tangible interpretation, it simply represents 
something as airy as happiness with the given variant. 

The vendor is able to produce N different variants. However, when he is producing 
more variants, his expenses grows due to need of an additional investment. The vendor's 
tendency to produce only few different variants can be modeled e. g. by a nonlinearity 
of expenses (doubled production of one single variant does not require doubled expenses). 
We adopt another approach; we assume that to initiate the production of a variant, the 
vendor has to pay additional charge Z > (this represents initial costs). 

Now let's assume that the vendor offered k most favorable variants (thus a — 1, . . . , A;, 
k < N) to customers and the number of units sold of variant a is Ua- The total vendor's 
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Figure 1: One particularly simple choice for the buyers' acceptance function f{x). 

profit is 

k 

X{k, {n,}) = MPc. - Va) - kZ. (1) 

Here the last term kZ comes for the initial costs of k produced variants, Pa — Va is the 
profit for one sold unit of variant a. Due to the monetary rescaling used to confine Ua to 
the range [0; 1], units for profit, initial costs and prices are arbitrary. 

It is natural to assume that when buyer i is asked about interest to buy variant a, the 
decision is based on the cost Xi^a- We formalize this by the assumption that the probability 
of acceptance is a function of the variant cost; this function we call acceptance function. 
Obviously, /(x) is a decreasing function of the cost x. Moreover, we assume /(O) = 1. 
This means that if a buyer considers a variant to be the perfect one, she surely buys it. 

When we offer a random variant to one buyer, the acceptance probability is 

7r(x)/(x)dx = p. (2) 





Here tt{x) is the probability distribution of cost x (i.e. it defines what "to offer a random 
variant" really means). The probability p of accepting a random proposal is an important 
parameter of the model. From our everyday life we know that largely we do not agree to 
such an offer. For this reason we assume p -C 1 in our calculations. 

One example of a reasonable choice for the acceptance function is (see fig. [I]) 

J' - '^l-'f (°^^f2rf- (3) 

^ ' |0 {2p<x). ^ ' 

with p < 0.5. This choice is especially convenient due to its simplicity. If we now assume 
the uniform distribution of the buyer's costs, tt{x) = 1 for < x < 1, parameter p of the 
acceptance function ([3]) is just the probability p of accepting a random offer introduced in 
the previous paragraph. 

In the rest of this paper we assume that the prices of all variants are the same and equal 
to 1, = 1. This relieves us from many technicalities, and helps to highlight important 
features of the model. Nevertheless, generalization to various prices is straightforward. 
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3 No correlations in costs 



We begin our investigation with the simplest case of the presented model — the market 
without correlations, where all costs Ua and Xi^a are mutually independent. We model this 
by costs uniformly distributed in the range [0; 1]. To keep variants ordered, we first draw 
their costs and then we renumber all variants achieve yi < 1/2 < ■ ■ ■ < Un- It follows that 
after averaging over realizations, the formula (y^) = ol/{N + 1) holds. 

3.1 A vendor without knowledge of buyers' preferences 

If a vendor wants to discover which variants are most acceptable for buyers, in a market 
without correlations each buyer has to be asked for preferences. This cannot be done in 
big markets, thus it is natural to investigate the case with no information about buyers' 
preferences on the vendor's side. In sec. 13.41 we show that even an expensive global opinion 
survey brings only a negligible contribution to the vendor's income. 

Without any information about preferences, the vendor is not able to discover which 
variants are most favored by buyers. Therefore the best strategy is to offer variants that 
are most favorable from his point of view. Let's label the number of variants the vendor 
is willing to offer as k. We assume that all these variants are available to buyers simul- 
taneously, similarly to different types of shoes available in a shoe shop. Every buyer goes 
through the offered variants and decides whether some of them are suitable or not. 

From the buyer's point of view, the vendor makes random proposals; the probability of 
accepting one particular offer is thus by definition equal to p. The probability Pa that one 
particular buyer accepts one of k proposed variants is complementary to the probability 
(1 — pY of denying all offered variants. Thus we have 

PA = l-{l-pf ^l-e-^\ (4) 

where the approximation used is valid for pk i.e. for very choosy consumers (than p 
is a small quantity) and a small number of offered variants. Now the average number of 
items sold by the vendor to all M buyers is MP a- Since no correlations are present, the 
average number of items sold of variant a is {ua) = MPA/k, it is a decreasing function of 
k. 

The quantity of vendor's interest is the total profit X introduced in ([T]). Its expected 
value can be found using {ua), {Va), and Pa- We obtain 

Xuik) = M(l - e-'^) (1 - ^(^) - kZ. (5) 

Here the subscript U reminds that we are dealing with an "uninformed" vendor. This 
function is sketched in fig. [2] for three different choices of initial cost Z. The optimal 
number of variants the vendor should offer maximizes his profit. One can easily show that 
when X[j{0) < 0, Xu{k) < for all k > 0. Thus the condition X^{0) < 0, which can be 
rewritten a.s Z > Mp, characterizes a market where the optimal vendor's strategy is to 
stop the production and stay idle. 
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Figure 2: Expected profit of tlie vendor witliout informations, Xij{k), drawn against k for 
small value of initial costs (solid line), medium initial costs (dashed line) and high initial 
costs (dotted line). In the last case the condition Z > Mp is fulfilled and the optimal 
vendc 




Figure 3: The optimal number of offered variants and the optimal profit as functions of 
initial cost Z for M = 500, N = 2 000, and p = 0.05. Numerical results (empty circles) are 
averages over 1000 realizations, analytical results (solid lines) come from eq. For the 
optimal profit arbitrary units are used. 



Since for every product numerous numerous variations can be made, the total number of 
variants the vendor can offer, A^, is large. Thus we are allowed to assume that the optimal 
number of offered variants satisfies the condition fcopt N and solve the maximization 
condition X^{k) = approximately. We obtain 

*op. = iln^, X<.„. = M-^(l + ln^Y (6) 



Z, - -upu 1 ' ry 

P \ Zj 

where Xopt is the optimal expected profit, Xopt = Xu(ka^^- The used approximations are 
valid when Z > MjN and p < 1. 

In fig. [31 these results are shown to match a numerical treatment of the problem. In the 
figure we see how the initial cost Z influences diversity of the vendor's production: decreas- 
ing Z increases differentiation of the vendor's supply in full agreement with expectations. 
We can examine this feature in detail if we plot the optimal number of offered variants 
against Z for one particular realization of the model as it is shown in fig. H] (thickness of 
the lines is proportional to the number of buyers of a variant). 
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Figure 4: Differentiation of the vendor's production for various initial costs Z: single 
realization of the model (no averaging present) with M = 500, N = 2 000, p = 0.05. 
Vertical axis has no significant meaning, it serves purely to distinguish different variants. 

3.2 Improvement of the vendor's profit by a sequential offering 
of variants 

So far we dealt with a very passive approach of the vendor. While offering k variants to the 
market, he had no influence on the sale. In consequence, due to the absence of correlations 
in the system, every offered variant had the same average number of items sold. In a big 
market this is a natural approach. While the use of advertising can promote some variants, 
its treatment exceeds our scope. 

In a small market a personal offering is possible. The vendor can promote favorable 
variants to increase the profit simply by offering the most favorable variant first. If a buyer 
is not interested, the second most favorable variant follows, etc. The average sale of the 
first variant is {n[) = Mp, for the second variant it is (rig) = M(l —p)p and in general we 
have {n'^) — Mp{l — Hence the expected total sale is 

k 

J2 Mpil - pf-^ = M [1 - (1 - p)''] . 

i=l 

This is equal to the expected total sale MP4 of the uninformed vendor in the previous 
section. We can conclude that the vendor's profit improvement (if any) does not come 
from an increased total sale but rather from an increased sale of the variants that are more 
profitable for the vendor. 

Now we investigate the optimal number of variants to offer in this case, k'^^^. Since 
(n^) decreases with a, at some moment it is not profitable to offer one more variant and 
the vendor's profit is maximized. The corresponding equation (n^) = {n'^){ya) + Z can be 
solved with respect to a, leading to k' ^. When the total number of possible variants N is 
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Figure 5: Successive offering: numerical and analytical results for the vendor using stopping 
condition described in the text in the markets with various sizes (on horizontal axis we 
have q = Z/M). All numerical results are obtained as average of 10 000 realizations with 
p = 0.05, N = 2 000; solid line represents eq. [3 



big, {ya) <^ 1 and the term (n^) can be neglected. The approximate solution is then 

"^^^ ^ Hi-p) ■ 

This optimal number of variants to offer is smaller than fcopt given by eq. We can 
also notice that when p is small, using approximation ln(l — p) ^ —p we are left with 
Kpt ~ ^opt- This is an intriguing property — by the two different approaches we obtained 
the same result. To compare k'^^^ in markets with different sizes, we plot it as a function of 
Z/M in fig. [5l As can be seen, in a big market (M > 100 000) eq. ([7]) fits well a numerical 
simulation of the system. 

One can examine also the increase of the vendor's profit caused by the change of the sale 
method. Using previous results, the approximate formula AXopt ~ Z[l+\n{Mp/Z)]/ {Np^) 
can be obtained. We see that when the total number of variants is big, sequential 
offering results in a small growth of the vendor's profit. Nevertheless, in a system with a 
limited offer (small A^) or with very choosy buyers (very small p), the improvement can be 
substantial. 

Here we should notice, that the stopping condition "income greater than expenses" 
introduced above can be hard to use in practice. It is because is a random quantity 
and can drop to the disadvantageous region n'^ < Z + n'^Ua even when (n^) is big enough 
to cover the expenses. Thus for the vendor it is not enough to simply check profitability 
of the sale of one particular variant n'^. Rather he has to take into account sales of all 
previously offered variants. This is especially important in systems with a small number 
of buyers M where relative fluctuations are bigger. This effect is shown in figure [5] where 
numerical results for the vendor blindly using the stopping condition are shown for various 
market sizes. Clearly as M increases, numerical results approach the analytical result ([7]). 



7 




5 10 15 "0 5 10 15 



^1 ^1 

Figure 6: The optimal number of variants to offer (left) and the optimal profit (right) for 
vendor 1 (solid line) and for vendor 2 (dashed line) against Z^. The initial cost of the 
second vendor is = 5.0, M = 500, and p = 0.05. 

3.3 Competition of two vendors 

In real markets we seldom find a monopolist vendor; competition and partition of the 
market is a natural phenomenon. To investigate the model behavior in such a case we 
introduce the second vendor to the market. We assume that the vendors differ by initial 
costs, which are Zi and Z2. Again we do not consider the influence of advertisements and 
reputation, albeit they are vital in a market competition. 

The course of the solution is similar to the one leading to eq. (0). We label the number 
of variants offered by vendor 1 as ki, the number of variants offered by vendor 2 as k2, and 
we assume that there is no overlap between offered variants. The aggregate sale of two 
buyers is MP^ where 

PA = 1 - (1 - P)'''^''' - 1 - exp[-p{ki + k2)]. 

With our assumptions about the equal status of the vendors, every offered variant has the 
same average sale. Therefore both vendors gain the share proportional to the number of 
variants they offer. Thus vendor 1 takes ki/{ki + /c2 of the total sale and vice versa. When 
ki/N,k2/N <^ 1, we can simplify the expected profits to the form 

X,{k„ k2) = Af (1 - e-P('=^+'=^)) - k,Z^, (8a) 

X2{k,, k2) = M(l - e-P^''+'^^) - k2Z2. (8b) 

Both parties maximize their profits by adjusting ki and k2. The corresponding system 
dkj^Xi{ki, ^2) = 0, dk2X2{ki, ^2) = cannot be solved analytically but its numerical treat- 
ment is straightforward. The result is shown in fig. [6] where we have fixed the initial cost 
Z2 to investigate how fcopt and Xopt for both vendors vary with Zi. 

We see that at Zi ^ 12 vendor 1 stops the production for he cannot stand the 
competition of vendor 2. By putting ^2 = in the equations 9fc^Xi(/ci, ^2) = and 
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^^2^2(^1, = we obtain the expression for the value Z* when this price-out occurs 

Mp ( Z^ 
^ ln(Mp/Z2) V Mp 

It is in a good agreement with the values found by a numerical simulation of the model. 
Important feature of this result is that it depends on the initial price Z2 of the competitive 
vendor — decreasing the production costs can expel others from the market. 

One can notice that when vendor 1 tries to increase the profit by deliberately increasing 
ki (with the intention to increase the sale), the term —k^Zi prevents the success of this 
strategy. As a result, the vendors have to adapt to each other. In mathematic terms, 
-^i(^iopt, fc2opt) > Xi{ki^ k2opt)- At the same time, the sum of profits is not maximized at 
/ciopt and /i;2opt- It is more profitable to remove the less efficient producer (the one with the 
higher value of initial costs). This is an analogy of a real market where ruining (or taking 
over) of a competitor can improve company profit. 




3.4 An informed vendor 



Now we would like to investigate the artificial case of the market where the vendor knows 
costs Xi^a of all buyers. This knowledge can be used to increase the optimal profit. We 
start with a simpler question: if the vendor offers only one variant, how much the sale can 
be increased by a good choice of the variant? 

The probability that buyer i is agreeable to buy variant a is f{xi^a)- Since costs Xi^a 
are random and independent, only the average acceptance probability p plays a role and 
the number of users willing to buy this variant, Ua, is thus binomially distributed with the 
mean (ua) — Mp and the variance cr^ = Mp{l —p). When the number of buyers M is big, 
we can pass to a continuous approximation and assume the normal distribution of 
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(10) 



The biggest value from the set {ria} (a = 1, . . . , A^) we label as m. This is the number 
of potential buyers for the most accepted variant and the vendor does the best when by 
offering this variant. The probability density /Ar(m) (often called extremal distribution) is 



fN{m) 



N 



27r(7 



exp 



(m — Mp) 
2a^ 



21 



- + - Erf 
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m — Mp 

(T\/2 



N-l 



(11) 



The multiplication by appears because we do not care which one of all variants is 
"the most accepted" one and the error function term represents the probability that the 
remaining A^ — 1 variants are less accepted. 

Since we are interested in big values of A", we expect that the difference (m) — M is big 
in comparison with a. Therefore we use the approximation Erf(x) 1 — exp[— x^I/Vttx^, 
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Figure 7: The relative growth of the vendor's sale 6 is drawn against the total number of 
buyers M. Solid lines represent the analytical result, outcomes from numerical simulations 
are shown as symbols. 



which is valid for x ^ 1. When the error function value is close to one, we can also use 
the approximation (1 — x)^ ~ exp[—Nx] {x <C 1) to obtain 



jN[m) ^ exp 



(m - Mpf aN exp [ - (m - Mpy/2a^] 



2(t2 ^/27l m - Mp 

This form is too complicated to obtain an analytical result for (m). Instead we compute 
the most probable value m 

I 

m ^ Mp + (TW 2 In 



271 

Here the first term Mp represents the average value of the sale and the additional term 
represents the gain arising from the additional vendor's knowledge. To get a better notion 
about the sale growth we use the relative sale growth 



, m-Mp / 1 pMN^ , , 

Mp ]l Mp 2tt ^ ^ 

To simplify the formula, the assumption p <^ 1 has been used. A comparison of this 
result with a numerical simulation of the model is shown in fig. [3 As can be seen, a good 
agreement is obtained. 

When 5 <C 1, all vendor's information is indeed useless and the average sale improve- 
ment is negligible. The inequality 6 leads to the condition 

Ar2«|ZLe^f. (13) 
Mp 

Thus when the number of variants is not large enough, buyers' opinions in the uncorrelated 
market cannot be used to increase the vendor's sale and profit. 

From the previous results we can draw useful implications about the vendor with perfect 
information, offering more than only one variant. When the total number of variants is 
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big, the number of variants offered by tlie vendor is small in comparison with A^. Therefore 
the average sale of all offered variants is increased at most by 6 given by eq. (1121) and the 
same applies to the total sale. However, the vendor is interested mainly in his profit. When 
we take into account different costs ya of variants, the resulting growth of the income due 
to the informations is even smaller than 6 because the variant with the highest sale can 
have a high cost for the vendor. Thus condition ffT^ is has more general consequences. It 
specifies the circumstances when even the perfect information about buyers' preferences 
do not help the vendor to achieve a significant improvement of his profit. 



4 Correlations in the system 

Now we would like to add one important flavor to the model — correlations. They arise 
from conformity of people's tastes (buyer-buyer correlations) and from the fact that high 
quality preferred by buyers results in high costs on the vendor's side (buyer-vendor anticor- 
relations). To approach the behavior of a real market, we investigate how these correlations 
influence our results obtained so far. Before doing so, we briefly discuss correlations from 
a general point of view. 



4.1 Measures of correlations 

A correlation is the degree to which two or more quantities are associated. We shall discuss 
different ways how to measure correlations and how to introduce them to the system. In 
particular, we would like to measure the correlation between two lists (vectors) of costs: 
Xi and Xj (two buyers) or Xi and y (a buyer and the vendor). All lists of our interest have 
length A^ and contain real numbers between and 1. A common choice for the correlation 
measure is Pearson's correlation coefficient r. For lists x and y it is defined as 



2 

Ea=l(a;a -^)(ya - y) 



Ea=l(3^" - Xy Ea=l(l/a - y) 



(14) 



This measure is sensitive to non-linear transformations of values in lists x and y. In 
addition, since it originates in the least-square fitting of the data by a straight line, it 
measures only a linear correlation. For this reasons, in this work we use another correlation 
measure, Kendall's tau. For lists x and y it is given by the formula 

= N(N-l) X]^"/^' aa/3 = sgn[{xa-Xf3){ye,-y(3)] (15) 

and it ranges from -|-1 (exactly the same ordering of lists x and y) to —1 (reverse order- 
ing of lists); uncorrelated lists have r = 0. Notably, Kendall's tau is insensitive to all 
monotonic mappings of the data. This is the strongest property we can expect from a 
correlation measure — more general transformations, nonmonotonic mappings, can sweep 
out any structure present in the data. 
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Figure 8: The upper bound Mm as a function of tq for two different lengths of hsts A^. In 
both cases the upper bound Mm is the same over a large part of the region tq < and 
drops to 1 when tq — > 1. 



4.2 Lists with a given correlation degree 

Now we would like to construct a set of lists that have mutual values of Kendall's tau 
equal to tq. Such a set would represent lists of buyers' preferences in an equally dispersed 
society. Since buyers' tastes are to a certain extent similar, we expect positive correlations 
with To > 0. Nevertheless, in the following discussion we do not confine ourself to this 
region. 

First we address a different question. Let's assume that between lists 1 and 2 there is 
Ti2, between lists 1 and 3 there is ris. Does it imply any constraints on r23? The answer 
is yes. It can be shown (see Appendix that T23 fulfills the inequality 

ki2 + nsl - 1 < ^23 < 1 - In2 - nsl, (16) 

which is an analogy of the triangular inequality for side lengths of a triangle. From ffTUj) 
we can draw various simple conclusions. First, if we want to construct three lists which 
have pairwisely tq, it is possible only for —1/3 < tq < iQ Thus it is impossible to have 
more than two lists which are perfectly anticorrelated. Another simple result is that when 
ri2 = -1, inevitably T23 = -n^. 

Now the question is whether we are able to create the whole system of M lists which 
all have pairwisely Kendall's tau equal to tq. The answer depends on the magnitude of M. 
It can be shown (see appendix [C]) that the upper bound for M is 

fo , 1„„ (l-ro)(jV-l)jV / . p^ 

_ I2 + I0g2 4 (T-o>Uj, 

- \min [2 + log2 (izZIlMzM, 2 log^ ^] (r, < 0). ^''^ 

As can be seen in fig. [8], this quantity grows slowly with the list length A^. Therefore 
to model a market with a large number of equally correlated buyers we would need an 
enormous number of possible variants. 



-'^An example for lists with the pairwise value tq = —1/3: xi = {3, 2, 1}, X2 = {2, 1, 3} and X3 = {1, 3, 2}. 
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4.3 Generation of correlated lists 

In the previous paragraphs we found that the society with a fixed mutual correlation degree 
of buyers is limited in its size. Therefore to introduce correlations to the presented market 
model we need a different approach. While copulas represent a general tool (see e.g. [ini 
[TT]). they are useful mainly for numerical simulations and offer only small possibilities for 
analytical results. Here we adopt a simpler way to generate correlated lists. 
Let's consider the lists of variables 

Xi,a = - t) ai^a + t Ca, (18a) 

y^ = {l-t)b^ + stc^ + ^{l- s), (18b) 

where Oj^Q,, ba and are independent random variables uniformly distributed in the range 
[0; 1]. Here s = ±1 indicates correlation/anticorrelation between Xi and y and t G [0; 1] is 
the binding parameter controlling strength of the correlation: t = leads to uncorrelated 
lists, t = 1 to perfectly correlated (s = 1) or anticorrelated (s = —1) hsts. In all cases, 
values Xi^a, Va lie in the range [0; 1]. 

For the lists defined above, it can be shown that (see appendix [B|) 

{,'Txy) S {Txx) {t'xx) 

where t/{l — t) = u. Plots of (r^^) and (r^y) are shown in fig. [HI Since buyers' lists are 
prepared using the same formula, the average value of their correlation is non-negative. 
Notably, for any value of r we can find suitable s and t that produce lists with the expected 
correlation equal to r. 

Lists created using eq. (fT^ do not have fixed mutual correlation, its actual value fiuc- 
tuates around the mean value given by (1191) . According to the law of large numbers, /(r) 
is normally distributed. In appendix [B] it is shown that the variance of r is proportional 
to 1/A^. Such fiuctuations are negligible for long lists. We can conclude eq. (ITSll present a 
way to create a system with the desired amount of correlation r for any r, M and N ^ 1. 

Yet there is a hitch in the proposed construction of correlated lists. The parameter t 
infiuences the distribution of costs: for t = or t = 1 they are distributed uniformly, for 
t = 0.5 f{x) has a tent shape. This is an implausible property: the changes of the cost 
distributions can drive or distract the phenomena we are interested in. To fix this problem 
we propose the following two solutions. 

First, to obtain correlated lists we can use the formulae 

1 fa-l 1\ , f \\ a-\ , , 

a;i,a = - + I y-- 1 +(l-t) I Si,„-- 1, V<. = ^^-^, (20) 

where is a random quantity distributed uniformly in the range [0; 1]. The complicated 
form of the Xi^a has a simple meaning. The vendor's costs grow uniformly with a and 



(10-6m + m2) (m<1) 
(15-^ + ^) («>1) 



(19) 



13 



buyers' costs are connected to the vendor's by the parameter t G [0; 1]. The term propor- 
tional to 1 — t introduces a noise to the system, resulting in differences between buyers' 
and vendor's lists. Finally, the term 1/2 represents the average value of buyers' costs. It 
is easy to check that Xi^a given by (l20l) is confined to the range [0; 1] for every t G [0; 1] 
and s = ±1. The overall distribution of costs is uniform in the range [0; 1] and thus we 
avoid the problems of eq. ffT^ . Moreover, this construction is simple enough to tract the 
proposed model analytically. 

Using the techniques shown in appendix [B] we can find Kendall's tau in this case. In 
the limit N oo one obtains 

fe.) = |f(^° + <"^^'' (21b) 
U(l5-i* + *) (->1). 

where again u = t/{l — t). The form of (t^x) is identical with (fT9l) found before for a 
different construction of correlated lists. 

As we will see later, eq. f l^Ul) is not appropriate to produce anticorrelated lists. Hence 
we present one more approach here — less accessible to analytical computation but more 
robust. The normal distribution is stable with respect to addition of random variables and 
this motivates us to make the following choice 

Xi,a = Vl -tai^a + Vica, (22a) 
Ha = — t ba + sVi Ca (22b) 

where Oj^o, ba, Ca are drawn from the standard normal distribution. It can be showii^ that 
in this case 

2 

{txx) = -arcsint, {Txy) = s (r^^). (23) 
vr 

The course of (txx) is shown in fig. 

As our market model assumes costs confined to the range [0; 1], the costs given by (!22l) 
have to be transformed using the cumulative distribution function of the standard normal 
distribution $(a;). In this way we obtain 



Xi , 



$"'(a:.,a), Va = <^-\ya)- (24) 



Since this transformation is monotonic, it does not affect the value of (r) and we can use 
eq. (l23l) for transformed lists of costs. 



^In the derivation, the following formula is useful 

1 



e Erf (ax) Erf (6a;) dx = arctan ab/Jp{a'^ + 1}^ +p) 
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Figure 9: The dependence of (r) on t for the proposed constructions of correlated hsts. 

5 A market with correlations 

When we discussed the market without correlations, the probability distribution of the 
variant cost iTa{xi^a) was independent of a. Consequently, the probability of accepting 
variant a 



was also independent of a (we labeled P4 = p). As a result, when we change the acceptance 
function f{x) while preserving the quantity 7r{x)f{x)dx, the derived results remain 
unchanged. 

In the presence of correlations we witness a very different picture: the detailed shape 
of the acceptance function /(x) is important. To keep the algebra as simple as possible, 
from now on we adopt the simplest choice for / (x) : the step function f{x) = 1 — 0{x — p). 
This means that a buyer accepts a proposed variant only when its cost is smaller than p. 

In the following we first deal with the market where costs are given by eq. (I^Uj) for it is 
more accessible to analytical treatment. Then we shortly present analytical results for the 
market where to introduce correlations, eq. (l22l) is used. 

5.1 An uninformed vendor in a market with correlations 

Here we assume cost correlations created using eq. ( !20l) . When the vendor has no informa- 
tion about the preferences of buyers, similarly to sec. 13.11 the best strategy is to produce 
vendor's most favorable variants. First we focus on the case of positive correlations; in 
(!20|) we set s = 1 and < t < 1. Using (l25l) and the chosen step acceptance function /(x), 
the probability that one buyer accepts variant a is 




(25) 




1 

i-t 



l + (Ar-l)2 <a, 

[p-tjT^] l + {N-ir-±^<a 
a<l + (iV-l)2±p. 




t ' 



(26) 
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Since we expect the total number of variants to be very large and p rather small, the 
second region makes the major contribution and thus we simplify eq. (1261) to PA(ct) ~ 
{p-ta/n)/{l-t). 

We assume that the vendor is simultaneously offering his k most favorable variants. 
The probability that one buyer denies all offered variants is 



k ^ / 

PD{k) = X[[l-PA{a)\=X[[l--^^ 

a=l a=l ^ 



ta 



N{l~t + p) 



(27) 



Since is big, we use the approximation 1 
analytically, leading to 



X 



exp[— x] to evaluate this expression 



PD{k) 



p 



exp 



- 1 
pk 



k k 

JJexp 

a=l 
+ 



ta 



N{l-t + p) 



tk^ 



1-t 2N{l-t + p) 



(28) 



Here we used also 1 — p/{l — t) ~ exp[— p/ (1 — t)] which is valid when p/ (1 — t) is small. 
When this is not the case, denying probability Poik) approaches zero and thus accepting 
probability is virtually one regardless to the approximation used. 

With respect to ( 120|) . the sum of expected vendor's costs can be written as 



k 

a=l a=l 



(29) 



Here the first term represents fixed costs for producing k different variants, Ps{oi) is the 
probability that to one buyer variant a is sold. Since the probability that of the successful 
trade is 1 — Poik), from the condition X]a=i Ps{oi) = 1 — Poik) we can deduce 



Ps{a) 



PA{a) 



ELiPAia] 



[1-Pnik)]. 



(30) 



This corresponds to the portioning of the probability l — Poik) among k variants according 
to their probability of acceptance. 

Now we can use ( !2H|) . and (IHUl) to write down the expected profit of the vendor 
offering his k topmost variants X{k). It's not possible to carry out the maximization of 
this expression analytically — numerical techniques have to be used to find fcopt and Xopt- 
Results are shown in [10] as lines together with outcomes from a numerical simulation of 
the model; a good agreement is found for st > 0. Results confirm that positive correlations 
between buyers and the vendor increase the vendor's profit. This pattern is most obvious 
in the case t = 1 when the vendor can offer only the most favorable variant and still every 
buyer buys it. 
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Figure 10: The optimal number of variants to produce (left) and the optimal profit drawn 
against st for two different values of the initial cost Z. Lines show analytical results 
derived above, symbols represent numerical simulations (averages of 1000 realizations), 
model parameters are set to = 2 000, M = 500, p = 0.05. The decay of both quantities 
for st < is in agreement with eq. (1311) . 

In the numerical results shown in fig.UHlwe can notice one striking feature. When st < 0, 
fcopt changes rapidly and Xopt falls to zero quickly. Such a behavior is rather surprising 
for one do not expect abrupt changes in the region st < when there were none in the 
opposite region st > 0. The reason for this behavior is simple — when s = — 1, vendor's 
most preferred variants have cost too high to be accepted by a buyer. This effect can be 
quantified. When buyers' costs are generated by (1201) . the inequality Xi^a > t{N—a) / (N—l) 
holds. Due to the acceptance function only the variants with cost smaller than p are 
accepted. Therefore only variants with a > amin can be possibly accepted, where 

a^in = 1 + (iV - 1) ^ ~ iV(l - p/t). (31) 

Thus with negative correlations in the market, the vendor is able to sell the most favorable 
variant (the one with a = 1) only ii p > t. When p < t, the vendor sells no variants 
a = 1, . . . , amin — 1- Since amin grows steeply with t (already with t = 2p one obtain 
Q^min = -^/2), the vendor offering his top k variants has to offer too many of them and he 
suffers both big initial costs and big costs i/a- As a result the vendor is pushed out of the 
market. 

Without detailed investigation we can infer the system behavior when the step ac- 
ceptance function is replaced by a different choice. In the limit of careless buyers with 
/(x) = C, the influence of correlations vanishes and both fcopt and Xopt do not depend on 
st and the model simplifies to the case investigated in sec. 13.11 Thus as /(x) gradually 
changes from the step function to a constant function, the dependence on st gets weaker. 
In particular, if the largest cost x for which /(x) > is xq (for the step function xq = p), 
in eq. (13T!) p is replaced by Xq. As a consequence, amin = decreases and the steep decline 
of Xopt in fig- [ini shifts to a lower value of st. 
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Figure 11: The optimal number of variants and the optimal profit of the uninformed vendor 
in the market with correlations given by ( l22l) and ( 12^ . Numerical results are averages of 
1 000 repetitions, p = 0.05, = 2 000, M = 500. 

5.2 An uninformed vendor in a different market with correlations 

Now we switch to the market costs drawn using eq. (|22l) and transformed to the range 
[0; 1] by eq. fl2^ . As we already mentioned, this case is not allowable for an analytical 
treatment — hence we present only numerical results in fig. [TTJ They agree with our ex- 
pectations: when st = 1, for the vendor it's sufficient to produce only one variant. As the 
positive correlations diminish, the optimal number of offered variants grows and the profit 
shrinks. A closer investigation of the vendor's behavior in this case exceeds the scope of 
this paper and remains as a future challenge. 

6 Another trading model 

In previous sections we presented a way how to deal with the trading process. Here we 
shortly present a different model which arises from the same playground as our previous 
reasonings but highlight slightly different aspects of the market phenomenon. 

Let's have a market with M buyers and A^ different variants that the vendor can 
produce. Preferences of the interested parties are again represented by the scalar costs 
Xi,a, 2/a = 1, ■ • • , M, a = 1, . . . , N) uniformly distributed in the range [0; 1] (thus again 
we have no correlations in the system). 

In a market, a vendor is aware that when some buyer is not satisfied with the offer, 
she can choose a different vendor. Therefore every vendor tries to induce as small cost as 
possible to the customers. This can be done by offering of the variants highly preferred 
by many buyers. We can visualize the process by sorting preference lists of all interested 
parties. Now when some variant is near the top of a buyer's list, its cost is small and it is 
favorable for this buyer. 

We have to specify the criterion for the "variant preferred by many buyers". First, it 
can be the variant that is not too deep in nobody's list. Thus, if we label the position of 
variant a in the list of buyer i as ki^a and maxj ki^a as ba, the vendor chooses the variant 
a that has the smallest ba- The selection process is visualized in fig. [121 Now the question 
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Figure 12: The layout of the introduced trading model. Each column represents a sorted 
list of variants' costs (the most preferred at top). The vendor is willing to go down his list 
by (i, in consequence buyers are forced go down by some value h. Questionmarks signalize 
that after sorting of all lists we do not know standing of variants in the lists. 



vendor buyer 1 buyer 2 



m 


7 




? 


m 


7 




? 



is: how far buyers have to go down their lists? In other words: if we label h = min^ ba, 
what is (6)? 

Since we have M buyers in the market, the probability of a particular value b is ap- 
proximately given by the formula 



P(6) 







d 








1- (';■)"! 


X 














kn) _ 



(32) 



Here the first term denotes the probability that there is no such a variant which is among 
topmost 6—1 for every buyer and among topmost d for the vendorjfl The second term 
responds to the fact that there is some variant which is exactly on 6th place in the list of a 
buyer (we do not care who it is, thus the multiplication by M appears), among 6 topmost 
variants in lists of other buyers and among d topmost variants in the vendor's list. 

In eq. f lH2|) we can use the approximation (1 — x)^ ~ exp[— xrf] which is valid when 
a; -C 1. To calculate (6) = J2^=i bP{b) we replace the summation by the integration in the 
range [0; oo] which yields 

,33) 

Here we dropped terms vanishing in the limit N ^ oo (for there is a big number of variants 
that the vendor can produce). 

Since there are no correlations in the system, when the vendor offers his d topmost 
variants, every variant has the same probability to be chosen by a buyer. Thus the vendor 
has to go down his list on average by (1 + d)/2. On average this corresponds to the cost 

iv) = (34) 

A buyer is with the probability 1/M the one that has the sold variant on the 6th place 
of his list. With the complementary probability 1 — 1/M he has this variant somewhere 



^Since in one list each variant appears only once, this is only an approximate form of the probability. 
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Figure 13: Numerical and analytical results for {x) plotted against d for various values of 
M [N = 1000, numerical results are averages of 1000 repetitions). The analytical result 
for (y) is shown by the broken line. 




between the 1st and 6th place. Thus we have 

^ ' N\M^ ' M 2 J 2M2 ^ ^ 

When the number of buyers M is large, this approaches | d~^^^^ . A comparison of results 
(IMll and ( |35ll with numerical simulations is shown in fig. [131 a good agreement is found. 
A small discrepancy for M = 1 can be corrected using the result (y) ~ l/{d+ 1) which we 
develop in the following section. 

To discover the scaling behavior of (b), one can follow a shorter path. The probability 
that one particular offered variant is among topmost b in one buyer's list is b/N. For 
all buyers simultaneously the probability is {b/NY^ . Since the vendor offers d items, the 
probability that at least one of them is above the line is approximately d{b/N)^ . If this 
equals to 0(1), the trading is successfull. Thus we obtain (6) = N0{1) d~^/'^ . This result 
scales with M and d in the same way as the previous outcome of the detailed derivation. 

At the end we have to mention that this model of transactions between the vendor and 
buyers is not relevant for a high number of buyers because (x) decreases very slowly in this 
case. In other words: when M is high, the probability that there is a variant that is not 
worst in any buyer list approaches zero. 



6.1 A vendor producing more than one variant 

From the previous discussion we know that in a big market vendor can not insist on selling 
only one variant. Therefore we would like to investigate a more relaxed case where the 
vendor offers simultaneously d different variants. Then every buyer can choose the one 
the most suitable for him or her. We would like to investigate, how much suffer buyers 
in this case. To do so, we label the most favorable variant from the vendor's offer for one 
particular buyer b and compute the average value of this quantity. 
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The probability that one particular value b occurs is 



P{b) 



N -b + 1 



d 




N-3 + 1 



d 



(36) 



In this formula the product represents the probability that all d variants offered by the 
vendor are in the buyer's list lower than & — 1, the first term represents the probability 
that one of offered variants is on 6th place in the buyer's list. Now we can derive (6) 



Consequently, the average cost suffered by a buyer is given by {x) = {b)/N ^ l/{d + 
1). We see that M does not appear in (x). This means that the problem with the 
improper behavior of the model in big markets do not appear in this variation. Since in 
the calculation we did not make any approximations, no numerical simulation is needed to 
check the result. 



The aim of this paper is to explore the modeling of a market with inhomogeneous buyers 
and a vendor producing multiple variants. We see that the outcomes depend on whether 
one or both sides have adequate information about the other side or not. In standard 
microeconomics, Pigou [T^ has introduced the concept of price or demand elasticity. Ven- 
dors, knowing the buyers' reserve prices to pay and thus pricing individually, can reap 
significant profit — this is usually called the first-degree price differentiation. Our analysis 
can be considered as a generalization in this direction. We show that if individual tastes 
are taken into account, there is much complexity in the system; treating individual tastes 
with a large number of buyers presents a considerable mathematical challenge. Our models 
point out a convenient way to tackle this type of problems and we expect that many real 
economy-motivated problems can be analyzed in a similar way. Vendors and buyers have 
many ways to improve their welfare. 

In this study we have proposed two simple market models. While accessible to analytical 
solutions, they exhibit many features of real markets. In particular, diversification of the 
vendor's production and market competition are used as examples. The diversification is 
presented as an interplay between the vendor's pursuit to follow the buyers' tastes and 
the costs growing with the number of produced variants. We also show that in a market 
with many buyers without preferences correlations, the knowledge of these preferences 
doesn't increase the vendor's profit. When correlations are introduced to the system, many 
technical complications arise. Nevertheless, the results are consistent with the expectations: 
a positive correlation between the buyers' and vendor's costs improves the vendor's profit. 
Also, when interests of the two parties diverge (the correlation are negative), the vendor is 




(37) 



7 Conclusion 
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able to make only a small or even no profit. In addition, in sec. EJa similarly aimed model 
based on the well-known matching problem is investigated. 

As many other directions can be explored further, we do not consider this topic ex- 
hausted. First of all, in a correlated market the vendor strategies and the influence of 
information deserve attention. Furthermore, while in the present work we investigated the 
influence of tastes on the market, the product quality and price were excluded from the 
analysis. Eventually, the framework established herein can be used to raise the law of 
supply and demand from a microscopical point of view. 
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A Proof of r-inequality 

Let's have three lists x, y, z consisting of mutually different real numbers. Kendall's r 
for lists X and y can be written as r^y = {Pxy — Nxy)/T where Pxy is the number of pairs 
a < (3 that satisfy (xq, — x^){ya — yp) > 0, N^y is the same with a negative result of the 
product, and T = A^(A^ — l)/2 is the total number of different pairs a, (3. For the given 
values Txy and it follows that 

P,y = T(l + T,y)/2, N^y = T(1 " T,,y)/2. (38) 

We would like to find bounds for Tyz when r^y and t^z are given. First we reorder lists 
x,y,z so that hsts x is sorted in the descending order and for a < /? it is Xa — Xjs > 0. 
Such a rearrangement does not affect the values of Pxy, Pxz, Pyz, Nxy, Nxz, Nyz and thus the 
values of Kendall's tau between lists remain also unchanged. 

Since now all differences positive, from Txy we can deduce that there are Pxy 

positive differences y^ — yp and Nxy negative differences. Similarly, P^z differences — zp 
are positive and N^z are negative. The values of Pyz and Nyz depend on the relative 
ordering of lists y and z. The biggest possible value of Pyz occurs when positive differences 
ya — yp are aligned with positive differences Za — zp (see fig. [Hj). By contrast, the smallest 
value of Pyz (and thus the smallest value of Tyz) occurs when positive differences y^ — yp 
are aligned with negative differences z^ — zp. 

From fig. [H] we see that Py^ and Nyz fulfill the inequalities 

/V >|P -A^l P > \P -PI 

yz — \-'^ xy ^^xz\) yz — {-^ xy ^ xz\- 



Using Pab + Nab = T and fl58|) we obtain 



Z\t- — t- I <P <c t — -\t +r I 

2 \ ' xy ' yz \ —-^ yz — 2 ' ~ V^'^ 



—T-\-^\t —T \< — M < — -It -\- T I 
~ 2 ' •''y yz\ — ^"-^yz — 2 ' ■'^y ~ yz ' 

These two inequalities summed together and divided by T yield the desired inequality (fTB|) . 
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Figure 14: An illustration of the proof. The first case (first three lines) has the biggest 
possible value of Pyz, the second case has the smallest possible value of Py 



yz- 



B Expected values of (r) 

For lists created using ( |T8l) we can rearrange ( |T5l) as follows 

2 



Moreover, can be rewritten as 

{a^p) = P++ + P--P-+- 



a</3 



Pi _ = 1 - 2P : - 2P, = 1 - 4P. 



Here P+4. is the probability that both — and x'^ — x'^ are positive and so forth, the 
formulae P+_ = P-+, P++ + P + P-+ + P+- = 1 are used. According to ( fTSj) we write 

Xa - XjS = {I - t){aa - ap) + t{Ca - Cp) = (1 - t)A + tC, 



;i - t){h^ - hp) + t(c„ - cp) = {l- t)B + tC 



where A, B, C lie in the range [—1; 1] and are equally distributed with the density q{A) 
l-\A\. Now we have - t) = u) 



Pi+-\C) 



i{ucr[i-i{ucr] {c<i/ 



u 







{C > 1/u). 



If M < 1, the first case applies to all possible values of C, P(H — \C) = is possible only if 
u > 1. Finally, using 



with g{C) 



P{+-) 
\C\ it follows that 



j\{+-\C)giC) dC 



f^(lO-6M + M2) (m<1). 
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Figure 15: Construction of the set of lists with the given value of Kendall's tau for T = 10 
(thus N = 5) andW = A (thus tq = 0.2). 



The quantity (Txy) can be derived in the same way. 

The variance of be found by a direct computation of (t^s.). We have 



T _ 1)2 



The averaging procedure is straightforward. At the end we obtain 

4 



N 



where terms proportional to higher powers of 1/A^ were neglected. The variance is largest 
when t = 0, for t = ±1 obviously 0",- = 0. 



C System's upper bound with a given tq 

To obtain the upper limit M^, for the number of lists that have pairwise Kendall's tau 
equal to tq we use a constructive way of reasoning. Without a loss of generahty we assume 
that in the first list all A^(A^ — l)/2 = T differences Xa — (a < f3) are positive. From 
eq. (l38i) it follows that to maintain tu = tq, in the second hst exactly A''2 = (1 — tq)T/2 
pairs have to be negative. The same holds for all other lists we would like to add to the 
system, Ni = (1 — tq)T/2 = W is required for i>2 {Ni = is fixed by the chosen ordering 
of the first list). 

To represent the way of construction we use fig. [151 The first list is represented by T 
positive pairs there, second one hy T — N2 positive pairs and by N2 negative pairs. We 
would like to find a third list satisfying ri3 = = Tq. We already now that = W 
negative pairs have to be accommodated there. To achieve = Tq it is necessary that 
exactly half of the negative pairs in the second list meets with negative pairs in the third 
list. This can be fulfilled in two ways (lines 3 and 4 in fig. [T5i) which also have mutually 

T34 = To- 
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Now there are two important points to notice. First, the construction of the third and 
the fourth hst is impossible when A^3 is an odd number. Thus for corresponding values 
of To there are not more than two lists that have mutually Kendall's tau equal to this tq. 
Second, we have used purely combinatorial arguments here without taking care whether 
described set of lists (e.g. lists 1, 2, 3 and 4 from fig. [151) do exist. Thus we estimate an 
upper bound which cannot be exceeded but which can dwarf the real maximum by far. 

Now we can continue with the fifth list where again = W negative pairs are present. 
Among them exactly have to meet with negative pairs in the second list and also half 
of them have to meet with negative pairs in the third and fourth list. This can be achieved 
by lists 5 and 6 in fig. [15]), their relative Kendall's tau is also tq. 

During the construction process we divide W negative pairs present in the second lists 
into two groups with size W/2 (lists 3 and 4), then we divide again and obtain lists 5 and 
6 with groups of negative pairs with size W/ 4. Clearly, this sequence ends when we divide 
the original number of negative pairs W so many times that we arrive at 1 which cannot 
be divided further. Consequently, the upper bound we are looking for is the following 

(l-To)(N -1)N , , 

M^^2 + 21og2^^; = 2 + 21og2^ —. (39) 

This formula holds when Tq < 1 (for Tq = 1, clearly = 1)- 

In the previous construction, there is one hidden flaw. Since only Wjl negative pairs 
in third list meet negative pairs in second list, to prepare lists 1, 2 and 3 we use altogether 
W + W/2 pairs. If this number is greater then T, the third list cannot be constructed in the 
described way and the limiting Mm differs from the previously found result. To construct 
lists 5 and 6 we need together W + W/2 + W/4 pairs which has to be smaller then T. By 
generalizing previous argument we can write down the inequality for the maximum number 
of lists Mfn 

W + W/2 + ■■■ + iy/2*^'"/2"i < T. 
For Mm itself it follows that 

M„<21og2^^. (40) 

This limit is relevant only for tq < (with tq > we have W <T/2 and W + W/2 + . . . 
is less then T). As an actual upper bound, the smaller value from f[39|) and pUj) applies. 
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